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1 

CHIRAL SYNTHESIS WITH MODIFIED ENZYMES. 



This invention relates to chiral synthesis; more 
particularly, it relates to the modification of enzymes to 
facilitate such synthesis. 

Enzymes are biological catalysts which are specific 
both in terms of chemical activity and substrate structure, and 
it is this specificity which has been exploited in a variety of 
commercial applications. Although many such activities are 
known, it may be desirable to change the range of substrates that 
are suitable for catalysis and/or to change the efficiency of a 
given catalysis for a particular type of enzyme.. Given a type 
of enzyme with known key elements vis-a-vis substrate preference 
and hence activity, it may be .possible purposefully to change 
those elements to bring about desired modifications and hence to 
expand the potential industrial utility of a particular enzyme. 

Enzyme activity is primarily controlled by the amino 
acid composition especially in certain important functional areas 
of the enzyme, altering these amino acids is known to change 
activity and may be achieved by the use of either specific or 
non-specific techniques. For example, the introduction of a 
neutralising amino acid may facilitate the catalysis of a 
substrate with an altered charge and this could be regarded as 
a predictable alteration, although no result may ever be 
predicted with total certainty, especially where the tertiary 
structures of enzymes are not as precisely known as would be 
necessary for complete confidence. However, while it is possible 
to make individual changes by known means, this would prove an 
almost infinite task and so it is often convenient initially to 
make a "macro-change" and then to "fine tune" with discrete 
changes. of course, in a given case, a macro-change may prove 
to be sufficient, or, indeed, discrete changes may be all that 
are required. 



wo 93/15208 



PCr/GB93/00204 



2 

Although alteration of the enzyme structure has been 
described, this is not achieved by any direct effect on the amino 
acid components, but by known techniques on the DNA encoding for 
the enzyme prior to protein transcription. Talcing as an example 
the enzyme lactate dehydrogenase (natural substrate pyruvate) , 
when acting on the carboxylic acid analogue of pyruvate, oxalo 
acetic acid, it would have substantially reduced activity due to 
the negative charge introduced into the active site. In this 
case, site-directed mutagenesis involving the introduction of a 
neutralizing charge into the correct region of the active site 
alters substrate specificity allowing the enzyme to take on the 
activity that would be expected of a malate dehydrogenase. Such 
specific mutations may be considered predictable in gross terms, 
but are very unlikely to be the ultimate refinement in increasing 
specificity towards such a substrate. For alternative 
substrates, such as those with increased alkyl chain lengths, 
phenyl residues or heterocyclic additions, predictions of site- 
specific changes are xxnlikely to be reliable. It is probable 
that the changes necessary to accommodate such "unnatural 
substrates" are most likely to be required adjacent to or in the 
active site region of the enzyme, which in many enzymes may 
involve up to 20 amino acids, which may be derived from many 
disparate parts of the primary sequence. Clearly,, if one tried 
to proceed by alterations in individual amino acids, the scale 
of the undertaking would be impractical even with modern 
techniques . 

In order to achieve the desired objective while 
circumventing the above disadvantages, it is possible in the case 
of lactate dehydrogenase, for example, to make use of the known 
loop region forming part of the active site. As a convenient 
first step, at least a portion of the loop region may be 
exchanged for a larger or smaller section of loop region from a 
similar enzyme. This may be expected to allow some variation in 
substrate specificity and relative catalytic efficiency, while 
retaining the typical activity. Having chosen the most promising 
loop region for a desired substrate, which could indeed be the 



wo 93/15208 



PCT/GB93/00204 



starting wild-type loop, specific amino acid residues may be 
targeted for further change. In order to secure the best 
possible option, it is necessary to survey all possible amino 
acid combinations in the positions of interest- This is done by 
generating random nucleotides in the region coding for the amino 
acids targeted. Following routine cloning, it becomes necessary 
to select for a desired modification from amongst the numerous 
alternatives produced. Such screens are in common use. This 
approach to enzyme engineering is facilitated by the introduction 
of unique endonuclease restriction sites into the coding DNA, if 
such are not already present, at desired points. Such changes 
may often be achieved by alteration in the bases without altering 
the amino acid encoded due to code degeneracy or alternatively 
they are achieved by the introduction of codes as far as possible 
for similar amino acids. This allows the region of particular 
interest to be handled independently of the remainder. 

As will be appreciated from the foregoing, the present 
invention relates to a method for modifying the specificity 
and/or efficiency of an enzyme, while retaining its catalytic 
activity, characterised in that it comprises: selecting an 
enzyme, the tertiary structure of which is substantially known 
or deduced; identifying at least one specificity and/or 
efficiency-related region; identifying or constructing unique 
restriction sites bounding the identified region in the DNA 
coding therefor; generating a DNA sequence which corresponds to 
at least a portion of the identified region, except that the 
nucleotides of . at least one codon are randomized, or selecting 
as a substitute for at least a portion of the identified region 
an alternative such region, which may itself be similarly 
randomized; using the generated or substitute DNA sequence to 
replace the original such sequence; expressing the DNA including 
the generated or substitute DNA sequence; and selecting for a 
desired modification so that the DNA coding therefor may be 
isolated. 



It will be described in more detail below, but the 
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present method may be illustrated by reference to a 
dehydrogenase, in particular an ct-hydroxy acid dehydrogenase, 
such as lactate dehydrogenase. In this illustration, it is the 
loop region of the enzyme which is identified initially as being 
specificity and/or efficiency-related. Generally, the randomized 
DNA is generated by means of an inosine triphosphate PGR method 
or a spiked oligonucleotide method or a PGR assembly method, all 
of which will be discussed in more detail below. If a substitute 
is to be selected for at least a portion of the region of 
interest, it is often based on a corresponding sequence from a 
similar enzyme. Once the original DNA sequence has been replaced 
by the generated or substitute DNA sequence, it is cloned into 
a plasmid or phage vector and transformed into a bacterium or 
virus for expression. Thereafter, a screen may be used to select 
for a desired modification. Taking L-lactate dehydrogenase as 
an example, positions 101 and 102 are particularly appropriate 
for randomization. 



The present invention also relates to the use of such 

^?:\?y?iies particularly in the production of chiral 

products. Often, such processes involve the use of a cof actor 
recycling system. One example is the reduction of 2-oxo-4- 
phenyl-propanoic acid characterised in that it comprises the use 
of L-lactate dehydrogenase which has been modified in the loop 
region by the present method and another is the reduction of 4- 
methyl-2-oxo-3-pentenoic acid characterised in that it comprises 
the use of MVS/GG obtainable by the present method. 

Having outlined the present invention, it will now be 
described more fully. 

The use of enzymes in chemical synthesis has gained 
increasing acceptance as an academic possibility, while its 
introduction into industrial chemical procedures is rare. The 
potential advantages of enzymes as catalysts, such as obtaining, 
stereospecificity and regiospecif icity under mild conditions, 
have initiated many attempts to obtain enzymes suitable for 
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particular chemical conversions. 

Several approaches to selection of the enzyme are 
possible. Experimentation with currently-available enzymes may 
yield surprising results in terms of breadth of substrate 
specificity not predictable from the literature. It is thus 
possible to utilise commercially-available enzymes, which may 
have a low catalytic efficiency, but, because of cost, may form 
the basis of an industrial process. A second approach is to 
screen large numbers of environmental micro-organisms in an 
attempt to effect a particular transformation. Should such an 
activity be obtained, it is often required that the enzyme be 
obtained in a purer form than whole microbial cells or crude 
preparations thereof. To obtain enzymes from such a screen in 
sufficient quantity and at a reasonable cost for an industrial 
process requires extensive development often with the involvement 
of cloning and over-expression of the gene. Another approach for 
obtaining suitable enzyme catalysts is to modify the structure 
of an existing enzyme to improve its catalysis for a particular 
substrate. This approach of so-called "enzyme engineering", 
which is in its very early stages has great potential for the 
preparation of catalysts for the synthesis of homochiral 
molecules. The importance of these molecules in the synthesis 
of single isomer pharmaceuticals and agrochemicals is well 
recognised. 

Despite the obvious attraction of enzyme engineering, 
the results of amino acid changes are often, at best, only of 
limited predictability due to the structural complexity of 
enzymes. At present, it is not possible to predict the effect 
of certain amino acid changes on the finer points of substrate 
recognition and catalytic performance where the substrate is 
altered in size and additional functionalities introduced from 
the natural substrate. It is generally easy to predict the 
removal of activity by the elimination of one of the 
catalytically-vital amino acids which are generally well known 
from the classical studies of enzyme mechanism and function. To 
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enhance the activity towards an unnatural substrate remains a 
challenge. 

The opportunity for enzyme engineering may be 
calculated for a 300 residue protein of 20 amino acids as lo^^^ 
possible sequences. The vast majority of these sequences cannot 
have been explored for biological function. It may be suggested 
that a typical large protein of 3 00 amino acids residues cannot 
represent a global optimum for any biological function, but at 
best is an assembly of empirically optimised 25-3 5 amino acid 
domains. Thus, enzyme engineering should be capable of improving 
a large frame-work for any particular target function. 

Recently, methods have been developed to express random 
15 sequences of DNA as protein fused to phage M13 coat protein and 
it has been suggested that it will be possible to mimic the 
process of evolution by suitable affinity chromatography to 
isolate both the required protein sequences and its gene (Kang, 
PNAS, M/ 1991, 4363). However, just as evolution has been 
20 unable to sample all possible sequences, so too the protein 
engineer will be limited to the number of M13 phage that may be 
screened (10^^ plaque-forming units are produced per litre 
culture of coli cells containing the phage M13) . With 10^^ 
variants, the length of DNA which may be optimised is obtained 
25 from 4" = 10^^, i.e. N = 24 bases or 8 amino acids. The other 
problem encountered is that a phage display system determines 
binding not catalysis and thus is not designed to obtain enzymes 
with new chemical potential. 

•^^ Random mutagenesis of existing proteins is also limited 

in its ability to produce radically altered proteins by problems 
of sampling all the possible variants. In addition, the genetic 
code is very resistant to change. Not only are codons redundant 
at the third position, but also amino acid residues with similar 

35 properties are coded by similar sequences and thus resistant to 
sparse mutagenesis. For example: (i) a codon having a T at the 
second position always codes for an amino acid residue having a 
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hydrophobic side chain; (ii) the codons for aspartate and 
glutamate differ only at the third position. Therefore, 
strategies, such as use of thioate nucleotides (Holm, Prot Eng, 
1, 1990, 181), which create randomly dispersed mutations (in 
which only one mutation is likely to be present in any codon) are 
unlikely to yield new proteins having dramatically different 
properties to those of the parent proteins. 

Although it should be possible to engineer any designed 
property into any protein framework, only those which have been 
well characterised are likely to be redesigned successfully. 

In order to obtain the fundamental knowledge required 
for rational redesign, a combination of crystallography, site- 
directed mutagenesis and transient kinetic techniques was used 
to relate function to structure in the NAD-dependent lactate 
dehydrogenases from both prokaryotes and eukaryotes* That 
knowledge not only revealed those amino acids required for the 
catalytic pathway, but also mapped those amino acids which are 
part of a major rearrangement of shape which is induced when the 
negatively-charged substrate acid enters the active site and 
causes the protein to sequester the substrate in an internal 
vacuole which is sensitive to the size of the substrate and which 
contains exactly balanced charge. Using this knowledge, it has 
been possible to design specific new enzymatic properties with 
respect to charged substrates and so avoid the low statistical 
probabilities associated with random mutagenesis. It should, of 
course, be appreciated that the present invention is more 
generally applicable than to this particular illustration. 

Accordingly Fig. l depicts the active site of lactate 
dehydrogenase. In this illustration, some of the residues which 
determine substrate specificity are carried on the under-surf ace 
of the "upper jaw". The rate-limiting step in lactate 
dehydrogenase catalysis is the rate at which this loop may sweep 
through a viscous solvent to close onto the upper surface of 
helix a2G. The rate-limiting step is largely independent of the 
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sequence of amino acids on the "upper jaw" and since the chemical 
step is much faster than the shape change, the lactate 
dehydrogenase system has the advantage that the loop sequence may 
be easily varied to achieve different substrate specificities 
5 without much danger that the chemical step will become rate- 
limiting. Thus, in order to obtain enzymes improved by 
engineering towards particular substrates,, a combination of 
techniques may be preferentially employed. Specific residues may 
be changed to accommodate functional groups, such as an altered 
10 charge to that of the natural substrate, but to perfect the 
enzyme for activity towards a different substrate, elements of 
the infinite variability of random amino acid changes may be 
required. This may be applied to a particular area of the enzyme 
and selected for using screening techniques. 

15 

An object of the present invention was to modify an 
already useful, but substrate-restricted enzyme, S lactate 
dehydrogenase, to provide an improved catalyst for reduction of 
the a-keto group in acids larger than the natural substrate, 
20 pyruvate. In particular, the substrates of interest contain 
bulky aromatic groups. 

The natural enzyme used as the basis for engineering 
25 was the thermophilic lactate dehydrogenase (LDH) isolated from 
Bacillus stearothermophilus . which has been cloned and expressed 
in Escherichia coli- 

This enzyme has been one of the most thoroughly 
3 0 characterised protein frameworks (Dunn, C. R. , et al, Philos. 
Trans. R. Soc. London Ser. B, 1991, 332, 184), including the 
study of inhibition, substrate interaction and genetic 
manipulation. The physical stability of the enzyme, especially 
to thermal denaturation, makes it an ideal candidate for 
3 5 demonstrating the features of redesign which would be generally 
applicable to a-hydroxy acid dehydrogenases, for example. 
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The modification of wild-type enzymes presents a 
significant challenge because, even in the case of a protein with 
considerable literature knowledge, the results may be unexpected 
and surprising. Thus, redesign of even well- studied enzymes is 
of limited predictability. 

Changes in the amino acid composition of enzymes and 
thus effects on kinetics and substrate specificity have occurred 
throughout nature and various methods have been developed in 
order to potentiate the natural divergence of enzyme structure. 
Random mutations may be produced in genetic information (and thus 
in the protein coded for) by the use of classical mutagenesis. 
Lately, the technique of site directed mutagenesis has allowed 
the alteration of specific bases in genes, thus producing 
directed amino acid changes in the target protein at a known 
position. Using similar techniques, it has been possible to 
achieve the replacement of significsint amino acid sequences in 
a functionally important area of the enzyme. 

Detailed knowledge of the protein, such as primary 
sequence and tertiary structure from X-ray analysis, along with 
molecular modelling allow the identification of the position of 
various amino acids in what are known as conserved regions. This 
is illustrated with the nomenclature of the amino acids of 
various lactate dehydrogenase enzymes. Thus, any structure in 
the protein which is retained between species is regarded as 
conserved and probably essential for the enzyme's function. This 
information will allow any change in a particular enzyme to be 
pinpointed for all other homologous enzymes across all general 
substrate, types; if this were not possible the enzymes would not 
fulfil the same biochemical function. The enzymes of particular 
interest at present are a-hydroxy acid dehydrogenases, which 
catalyse the NADH/NADPH dependent reduction of a keto group in 
an a-position to a carboxylic acid, or, alternatively, the 
reverse reaction where the a-hydroxy group is oxidised to the 
ketone. 
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Attempts to modify the enzyme lactate dehydrogenase to 
expand the natural substrate specificity to allow an increased 
reaction rate with larger substrates with various functional 
groups has led to the present unpredictable observations. 
Although it may be possible to prepare substrates and 
corresponding chiral products of interest by chemical synthesis, 
followed by wild-type enzyme reduction, such an approach may not 
be attractive and it may be that preparation via a redesigned 
protein framework may provide a more rational and cost effective 
approach. Additionally, the alteration of the enzyme has 
demonstrated that the activity towards the natural substrate may 
be so dramatically reduced that completely different substrate 
selectivity is produced. This may not be a requirement of a 
biotransformation catalyst, where the enzyme is presented with 
only one substrate species for reduction, but, when a mixture of 
potential substrates is present, such as may occur in a 
biological sample, this may be essential for achievement of 
selective conversion or the determination of one particular 
chemical species. This alteration in substrate specificity could 
also be advantageous in a biotransformation using whole cells 
where the intended substrate is necessarily contaminated with 
other entities which could also be transformed. 



In the work of Wilks et al (Biochemistry, 1990, 27., 
8587) a mutation strategy is described for the production of NAD- 
dependent dehydrogenases which have altered substrate 
specificity. The disclosed enzymes catalyse the reduction of 
homologues of pyruvic acid corresponding to the general formula: 
*^nK2n+i COOH, which may include straight- and branched-chain 
alkyl residues. The initial intention of the present work was 
to continue the design method for substrates with an aromatic 
function, in addition to extended alkyl residues and hydroxy 1 and 
keto substitution associated with the same base structure of a- 
oxoacids. 

Enzymes capable of reducing such substrates would be 
of particular value in the field of synthetic chemistry where an 
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a-keto compound could be converted stereospecif ically to the 
corresponding secondary alcohol. The production of individual 
optical isomers of secondary alcohols is especially valuable in 
the manufacture of optical isomers of pharmaceuticals and drug 
intermediaries. The feature of thermophilicity which may be 
obtained with some a-hydroxy acid dehydrogenases is valuable as 
it enables the enzymic reactions to be carried out at relatively 
high temperature where a rate acceleration may exist and the 
enzymes are inherently stable. These enzymes may also be 
suitable for incorporation into determinations of the levels of 
particular substrates obtained in biological samples under 
certain disease states. 

A numbering convention has evolved in the field of 
NAD-dependent dehydrogenases, which was originally based on an 
X-ray structure of dogfish muscle lactate dehydrogenase. This 
system numbers amino acids in ascending order extending from the 
N terainus. This system identifies conserved residues, such as 
glycine at positions 30 and 33, tyrosine at position 85, arginine 
at position 109, serine at position 163 and aspartic acid at 
position 168. 

Thus, in any given NAD dependant dehydrogenase, natural 
or subject to mutation, there are regions of sequence which are 
homologous with the amino acid sequence of the numbering 
convention. An important aspect of this convention is that any 
amino acid change in an NAD dependent dehydrogenase may be 
accurately described. 

In Table 1 below, an alignment of amino acid sequences 
is shown for three NAD dependent lactate dehydrogenases: the M4 
isoenzyme of pig, the testis isoenzyme of man and the Bacillus 
stearothermoDhi lus enzyme. (The symbols " - " do not signify 
breaks in the continuous polypeptide chains, instead they are 
conventional representation of discontinuities of numbering which 
allow alignment with sequences of other enzymes to give maximum, 
homology. ) 
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Table 1 

1 1 2 2 2 2 3 

^ 5 0 5 0345 0 

5ATLKEKLIAPVAQQETTIPNNKITVVGVG-QVGM 
STVKEQLIEKLIEDDE--SQCKITIVGTG-AVGM 

MKNNGGARVVVIGAG-FVGA 



10 



15 



3 
5 

A C A 
A C A 
S Y V 



4 
0 
I 
I 
L 



M N 



4 
5 

- S - L 
" D - L 

- G - I 



5 5 

0 5 

TDELALVDVL EDK 

ADELALVDVA LDK 

ADEIVLIDAN ESK 



20 



6 
0 

L K 
L K 
A I 



M M 
M M 
A M 



6 
5 
D 
D 
D 



H G 
H G 
H G 



K V 



7 
5 

F L Q T P 
F F S T S 
F A P K P 



K I 
K V 
V D 



8 
0 

V A N K D 
T S G K D 
I W H 



G D 



8 
5 

y 

Y 
Y 



S V T A 
S V S A 
D D C R 



9 
0 
N 
N 
D 



25 



S 
S 
A 



9 
5 

V V 
I V 

V I 



1 

0 
0 

V R 
A R 



1 
0 
5 

Q E 
Q E 



E S 
E T 



1 
1 
0 

R L N L V 
R L A L V 



1 1 
1 2 
5 0 

R N V N V F K 

R N V A I M K 



30 



ANQKPGETRLDLVDKNIAIFRSI 



35 



1 
2 
5 
P 
P 
E 



V K 

V H 



V M A 



1 
3 
0 
Y 
Y 
S 



3 
2 
A 
P 
P 
F 



N 
D 
Q 



1 
3 
5 
I 
I 
F 



V V 

V V 

V A 



V 
V 
V 



1 
4 
5 
L 
L 
L 



1 
5 
0 

W K 
W K 



T.W K 



40 



45 



1 
5 
5 
L 
L 
L 



1 
6 
0 

H R V 
T R V 
E R V 



1 
6 
5 
C 
C 
T 



1 
7 
0 
A 
A 
A 



1 
7 
5 
L 
L 
L 



1 
8 
0 
L 
h 
F 



11 1 
8 9 9 

50 5 0 5 

PSSCHGWIL-GEHGD 
PTSCHGWII-GEHGD 
PQNVHAYII-GEHGD 



2 2 1 
0 0 9 9 9 9*0 

0 5 A B C D A 
SSVAVWSGVNVAGVS-L 
SSVPLWSGVNVAGVA-L 
TELPVWSQAYIGVMP-I 



55 
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Table 1 rcont^ 

1 2 .:. 2 2 2 2 

0 1 2 2 



2 

3 3 4 



10 



2 5 ---0 5 0 5 0 

QQLNPEMG JtD NDSENWKEVH KhM VVESAYEVIKL 

KTLDPKLGTDSDKEHWKNIHKQVIQSAYEIIKL 
RKLVESKGEEAQKD-LERIF V 'N-V RDAAYQII 



E K 



2 2 2 2 2 2 

4 5 5 6 6 7 

5 0 5 0 5 0 
15K-GYTN-WAIGLSVADLIESMLKN--LSRIHPV 

K-GYTS-WAIGLSVMDLVP--LKN--LRRVHPV 
K-GATY-YGIAMGLARVTRAILHN--ENAILTV 

20 2 2 2 



7 8 8 9 

5 0 5 0 



2 2 2 3 

9 9 0 

fJUvS^ MYGIENEVFLSLPCVLNARGL?S 

.ciclvyn^ L^GIKEELFLSIPCVLGRNGVSD 

25SAYLDG LYGERD-VYIGVPAVINRNGIRE 



n ? ^ ^ ^ 3 3 3 

° 1.1 2 2 003 



30 5 0 5 



0 5 A B 1 



VINQKLKDDEVAQLKNSADTLWGIQKDLKDL 
-VVKIDLSEEE-ALLKKSAETLWNIQKNLI-F 
-VIEIELNDDEKNRFHHSAATLKSVLARAFTR 

Expression Cloning of human testis-specif ic lactate dehydrogenase cDNA. 
Millan, J.L., Driscoll, C.E. and Goldberg E. 

Sequence from cDNA - Genbank accession number J02938 (1986). 
" Sclir. s\X^.y.^°LH^^L ther^cphiU. l.=t,te dehyarogonase tron 
lZV;°lk,°-d,l\T!'%.%t' ""l""*. J- J- .nd. Atkinson, T. 
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Within the conventional numbering system are short 
sequences which may be correlated with specific structural 
elements in the folded polypeptide and which may have specific 
functional properties such as the substrate recognition site or 
the activation site. 

The substrate recognition site is carried in part by 
a mobile loop of polypeptide chain, conventionally numbered 98 
to 110. This sequence is contiguous but traditionally omits a 
residue 103. 

It is known for a-hydroxy acid dehydrogenases of the 
L type which generate S stereochemistry on reduction to the 
hydroxy function that a mobile surface loop exists which changes 
conformation after substrate binding. This loop consists of the 
amino acid residues 98-110 and contains an arginine at position 
109 which is important for* catalysis as the positive charge from 
the amidine group stabilises the stretched substrate carbonyl and 
thus decreases the energy required to obtain the transition state 
necessary fpr hydride transfer. 

The loop region is also involved in substrate selection 
and for that reason was the particular object for the present 
enzyme engineering study. 

The mechanism by which lactate dehydrogenase 
distinguishes different substrates is the ability of the 
substrate to fit into a proton-impermeable, fixed-sized internal 
vacuole which is formed when the mobile surface polypeptide loop 
closes down onto the protein surface. Not only is loop closure 
only possible over suitably small and singly negatively charged 
substrates, but also the loop closure triggers catalysis through 
the arginine 109 residue. The variation in composition and 
length of this mobile loop region is the immediate object. For 
the convenience of these experiments, a particular gene for wild- 
type Bacillus stearothermophilus lactate dehydrogenase was chosen 
where the amino acids alanine at positions 235 and 23 6 had been 
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changed for glycines. The effects of this particular amino acid 
substitution have been presented by Wilks et al. for a limited 
range of substrates (Biochemistry, 28, 8587) and generally 
increased the activity towards substrates with larger alkyl 
groups. Although used to demonstrate the principle of loop 
exchange, the technique would not be constrained to this 
particular enzyme, rather it is applicable not only to this 
mutant enzyme, but also to all other structurally-related a- 
hydroxy acid dehydrogenases, for example. 

The mutation where alanines at 235,23 6 are replaced by 
glycines has been combined with three mutations in the mobile 
polypeptide loop (residues 98-112), namely glycine 102 by 
methionine, lysine 103 by valine' and proline 105 by serine 
(MVS/GG) . 

This new enzyme construction was evaluated for activity 
towards longer substrates, in particular an unsaturated branched 
substrate 4-methyl-2-oxo-3-pentenoio acid, which is reduced to 
the following alcohol: 




C02Na 

Steady state kinetic measurements indicated that 
reduction of this compound by the wild-type enzyme proceeded 
slowly, obtaining an estimate for turnover of 0.033"^ in contrast 
to that obtained with the mutant enzyme of 1.2 3-^. The Km 
determined under similar conditions of substrate concentration 
(l-20mM) in the presence of 5mM fructose 1 , 6-bisphosphate was 
22mM. This observation regarding the specificity alteration 
towards a less flexible substrate indicates that the loop region 
has importance in substrate reduction. 

The method used to make new loop constructions was to 
insert restriction enzyme sites at either end of the DNA coding 
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for the loop region. These new restriction sites which are 
unique within the DNA coding for the enzyine, are cleaved and then 
religated with synthetic DNA designed to code for the required 
new loop region. One of the restriction sites introduced was for 
5 SacII near amino acid 97. The construction of the Sac ll 
restriction site required that the wild type coding sequence for 
cysteine 97 was changed to threonine. The Xbal site retained the 
wild-type amino acid sequence with arginine at 109, but did 
result in the creation of an Mlul site close to threonine 108. 
10 The new Klul site was used to advantage as it was destroyed in 
transf ormants and thus enabled easy distinction thereof from the 
wild- type gene. 

To illustrate the utility of the loop design approach 
15 to enzyme engineering, novel loops were introduced, two shorter 
by 3 amino acids and one— lofTger by 4 amino acids. The new 
enzymes generated in this manner were evaluated against a range 
of experimental substrates to determine the effect of the loop 
exchanges . 

20 

It was clearly demonstrated that the new loops altered 
the properties of the enzyme from that of the framework used in 
the construction thereof. The results also illustrate the 
difference obtained with the alanine glycine alteration at 
25 amino acids 235 and 23 6 and the introduction of the threonine in 
place of cysteine at amino acid 97. 

The increase in turnover of a-ketocaproate and a- 
ketoisocaproate With the alanine glycine double mutation was 
3 0 consistent with the results of Wilks et al. (Biochemistry, 29., 
1990, 8587) . The increase in turnover for the aromatic substrate 
2-OXO-4 -phenyl propanoic acid: 
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0 

CO2H 

along with increases in the Km for both was not obvious and 
indicates useful improvement with respect to the use of the 
mutant enzyme in the synthesis of the chiral a-hydroxy group of 
this aromatic substrate. 

The exchange of threonine for cysteine at amino acid 
97 maintained the beneficial Km effect for 2 -0x0-4 -phenyl 
butanoic acid: 

O 

^C02H over the wild-type enzyme. 

The effect of these individual mutations on the 
reduction of the aromatic substrates is of clear interest as the 
hoTnochiral hydroxyacids produced form useful chiral building 
blocks for the synthesis of bioactive compounds. 

The introduction of the new loop sequences further 
alters the substrate specificity of the enzyme reducing the 
turnover of the natural substrate from that of the wild type 
enzyme. The three new loop enzymes retained most of the wild 
type catalytic potential towards the 2-oxo-4-phenyl propanoic 
acid as shown by turnover and Km and, in the example of the 
longer loop and second shorter loop version, resulted in an 
increase in turnover. 

These examples serve to illustrate that the activity 
of the enzyme may be dramatically altered by changes in the loop 
sequences, both towards the natural substrate and larger 
unnatural substrates. 




In the large loop, it is observed that the Kcat/Km for 
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2-oxo-4"phenyl propanoic acid was 17 00 times better than for 
pyruvate compared to the wild type enzyme which is conversely 23 0 
times better for pyruvate, representing a switch in specificity 
of 391,000 fold. 

5 

The alteration in specificity of the enzyme from 
pyruvate to 2-oxo-4-phenyl propanoic acid renders the new enzyme 
suitable for the determination of the concentration of 2-oxo-4- 
phenyl propanoic acid, often termed phenyl pyruvate in clinical 
10 chemistry nomenclature, especially from body fluids, such as 
blood and urine. 



Phenyl pyruvate levels are normally low, but rise to 
significant levels with the increase in phenylalanine 

15 concentration, which is associated with the genetic disease 
phenylketonuria (Langenbeck et al., J. Inher. Metab, Dis., 4, 
1981, 69) . It is also possible that the phenyl pyruvate 
reductase or phenyl lactate dehydrogenase enzyme could be used 
in conjugation with phenylalanine dehydrogenase, a current method 

20 of determining the phenylketonuria level such that interference 
from phenyl pyruvate could be negated, thereby enhancing the 
sensitivity of the 'phenylalanine-based method. 

The construct having the restriction sites at either 
25 end of the loop region may be used to produce a series of 
dehydrogenases having loops of variable length and variable 
sequence. Thus, by restricting random mutagenesis to the region 
of lactate dehyxirogenase which has been identified as being 
important for substrate recognition, it is possible to isolate 
30 enzymes which may carry out a desired chiral reduction. The 
random mutagenesis may be generated by use of spiked 
oligonucleotides at specific positions and on different length 
loops or, alternatively, by the incorporation of inosine 
triphosphate in a polymerase chain reaction (PGR) that randomises 
35 either the entire loop region or specific residues. Both of 
these techniques have been employed to prepare mutant libraries 
using the restriction sites engineered into the DNA coding for 



"^OSmslOB PCr/GB93/00204 

19 

the loop region of LDH. A further PGR method was used to 
generate a random combinational DNA library of specific positions 
of the loop region. This technique was specifically targeted to 
positions 101 and 102 as these are involved in defining enzvTne 
substrate specificity. 

The PGR was initially used to generate 300 & 800 base 
pair fragments that had complementary overlapping ends. These 
primary products which had random sequences incorporated in the 
overlap, were then primed on each other and extended to yield an 
LDH hybrid gene. A second PGR with two outer primers annealing 
at non-overlapping ends was finally used to amplify the LDH 
product . 

Previous manipulation of the Bacin,.^ 
stearothermnr hilu.^ LDH gene involved cloning an EcoRI/PstI 
digested gene in to PKK 233-2, or M13 plasmid vectors. Where 
as now, it is possible to clone the PGR product into any one of 
a number of vectors, because one of the outer primers (2) , which 
anneals past the coding region, was designed with an additional 
EcoBI site incorporated. For example, in order to verify that 
there is a representative library with random sequences in the 
desired positions, it is possible to clone the gene with unique 
EcoRI sites into PUC18, which produces a high yield of DNA from 
mmi-preps, and subsequently the PGR product may be cloned into 
plasmid. or phage expression vectors, such as PKK 233-2. (See 
accompanying illustrative Fig. 2.) 

The following advantages are obtained with the PCR 

method: 

1. High yield of PGR product obtained. 

2. The ability to identify product as mutant DNA and select 
against wild-type sequences via Mlul digestion. 

3. Ease of handling and monitoring a ikb product compared to 
previous attempts which involved designing restriction sites 
either side of the loop region, such that a 40 base pair wild- 
type sequence may be replaced with a mutant sequence. 
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4. Speed of method. 

5. The design of primer 2 with an EcoRI site enables the 
cloning of gene product into a number of vectors. 

6. Use of double-stranded template for mutagenesis. 

7. Application of method to manipulate other areas of the LDH 
gene and the ease by which interesting mutations in different 
regions may be brought together in one molecule using this s.£li£e 
overlap extension method. 

8. Having mutant oligos with a high region of complementarity 
to the template at the 3 '-end ensures that annealing of oligos 
to the vector is highly efficient. 

In order successfully to utilise a directed random 
mutagenesis method that generates a library of mutants covering 
the loop region of the enzyme, or indeed any specific region of 
any target enzyme., requires a suitable screen for clones which 
express mutant enzymes of the desired specificity. For the 
dehydrogenases, this is simply provided by coupling NADH 
production with phenazine metasulphate to formation of insoluble 
blue formazan dye. 

The screen is based on the work of Katzen and Schimkel 
(PNAS, 54, 1218) and relies on the ability of a colony expressing 
an enzyme with specificity to oxidise the required substrate and 
to reduce NAD'*' to NADH. The reduced coenzyme then reduces 
phenazine metasulphate which in turn reduces nitroblue 
tetrazolium to form an insoluble blue dye. 

The mutant DNA is transformed into competent coli 
cells and is stored on agar plates containing 15% glycerol and 
ampicillin at -80<>C. obtaining electro-competent cells with high 
transformation rates has produced rates of 10^ per of DNA, a 
rate which produces a sufficiently representative population of 
mutant colonies for screening. Copies of this- plate are made 
using a velvet replicator and the copies grown up overnight. 
(The coli LDH activity is removed by incubation of the filter 
paper at 67 *c for 3 0 minutes, the activity of the wild-type 
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enzyine is not lost until 4 5 minutes at this temperature.) The 
copies are then screened against a range of substrates and 
individual colonies may be compared. Each master plate is 
screened at least three times to ensure conditions are ideal in 
each case* 

Using this technique demonstrates differential rates 
of staining have been shown between filter copies of wild-type 
colonies and those containing the malate dehydrogenase activity 
mutant enzyme (Q102R) with lactate and malate as substrates, 
respectively, conf inning the validity of the screen to identify 
individual colonies. 

The following illustrates the present invention: 
Mutagen esis of lactate dehydrogenase 

Mutantfer of lactate dehydrogenase from Bacillus 
stearothetTtrophilus were generated by the oligonucleotide mismatch 
procedure of Winter et al. (Nature, 1982, 299, 756) in M13 with 
the mutagenic oligonucleotide as the primer for in vitro chain 
extensions. The double alanine replacement at 23 5 and 23 6 by 
glycine was obtained using the oligonucleotide sequence 
3'CGCGCTACCGCCGATGTTTA5' . The wild type and mutant enzymes were 
expressed in the PKK223-3 plasmid in coli (Barstow et al. , 
Gene, 1986, 46, 47) . 

Mutagenesis to construct Sac II and Xbal sites at either end of 
the ge ne coding for wild tvoe active site loop. 

A 54-mer oligonucleotide was used to direct mutagenesis 
to introduce unique restriction sites (SacII and Xbal ^ at either 
end of the active site loop (amino acids 98-110) using the wild- 
type template (Barstow loc. cit) . The mutagenic oligonucleotide 
was : 



5'GTCCACAAGGTCTAGACGCGTCTCGCCCGGTTTTTGGTTGGCGCCCGCGGTAATGACAAC3 ' , 



wo 93/15208 



PCr/GB93/00204 



22 

the annealing, chain extension and cloning were as described by 
Clarke et al. (Nature, 1986, 329., 699). 

5 Mutants were identified by making mini-preps and 

restricting with SacII and Xbal- Mutant mini-preps were 
restricted with EcoRI and Xhol and the small fragment was sub- 
cloned into PKK223-3 containing Ala235Gly, Ala236Gly mutant LDH 
from which the small EcoRI / XhoI fragment had been removed (Wilks 

10 &t ai- Biochemistry, 1990, 29,, 8587). The resulting plasmid 
(pLDHrs) was transformed into competent coli TG2 cells. The 
whole sequence was redetermined using a "Dupont Genesis 2000" 
automatic sequencer and showed the correct loop sequence had been 
inserted. The partial DNA sequences of the wild type gene and 

15 the mutant with inserted restriction sites are shown in Table 2 
below. 
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Table 2 Comparison of the protein and DNA sequences of b. 

stearothermophi Inc; lactate dehydrogenase in the loop (93-111) region 
of wild-type and the mutant with SacII and Xbal restriction sites at 
either end of the loop, and the variable loop sequences derived from 
5 them. 

Wild-type DNA sequence in loop region (Cys changed to Thr is shown 
bold) : 

' 10 LeuValVallleCysAlaGlyAlaAsnGlnLysProGlyGluThrArgLeuAsp 

5 ' TTGGTTGCTATTTGCGCCGGCGCCAACCAAAAACCGGGCGAGACGCGGCTTGAT3 ' 
3 ' AACCAACGATAAACGCGGCCGCGGTTGGTTTTTGGCCCGCTCTGCGCCGAACTA5 ' 

Mutant DNA (pLDHrs) sequence in loop region: 

15 

LeuValVallleThrAlaGlyAlaAsnGlnLysProGlyGluThrArgLeuAsp 

5 ' TTGGTTGCTATTACCGCGGGCGCCAACCAAAAACCGGGCGAGACGCGTCTAGAC3 ' 

3 ^AACCAACGATAAT GGCGCC CGCGGTTGGTTTTTGGCCCGCTCTGCGC AGATCT GS ' 

Bacll Xbal 

20 Mlul 

Two oligonucleotides (LLA and LLB) used to synthesise the big loop by 

25 5'TACCGCGGGCAACATTAAATTGCAACAAGATAA3' (LLA) 
SacII 

5 ' GGTCTAGACGATCGCCCGTCGGGTTATCTTGTT3 ' ( LLB ) 
Xbal 

3 0 Big loop sequence in the 97-110 region (note the Mlul site is 

destroyed) : 

CysAlaGlyAlaAsnGlnLys ProGlyGluThrArgLeuAsp (wild-type). 

ThrAlaGlv AsnlleLvsLeuGlnGlnAspAsnProThrGlvAsp AraLeuAsp (big loop) 
35 5 ' TACCGCGGGCAACATTAAATTGCAACAAGATAACCCGACGGGCGATCGTCTAGACC3 ' 
3 ^AT GGCGCC CGTTGTAATTTAACGTTGTTCTATTGGGCTGCCCGCTAGCA GATCTGG S ' 
Sacll Xbal . 

Oligonucleotides for PGR synthesis of LeuLysGly and SerLysGly short 

4 0 loops: 

SIiA 5 ' TACCGCGGGCGCCAACT3 ' 

SLB 5 ' GGTCTAGACGGCCTTTCAAGTTGGCGCC3 ' 

SLC 5 ' GGTCTAGACGGCCTTTGGAGTTGGCGCC3 ' 

45 

Short loop sequence in the original 97-111 region ( Mlul site is again 
destroyed) : 

GlvGluThr 

50 CysAlaGlyAlaAsnGlnLysProArgLeuAsp (wild-type) 
ThrAlaGlvAlaAsn LeuLvsGlv AraLeuAsp (SLl) 
5 ' TACCGCGGGCGCCAACTTGAAAGGCCGTCTAGACC3 ' 
3 ' ATGGCGCCCGCGGTTGAACTTTCCGGCAGATCTGG5 ' 

55 ThrAlaGlvAlaAsn SerLysGlv AraLeuAsp (SL2) 
5 ' TACCGCGGGCGCCAACTCCAAAGGCCGTCTAGACC3 ' 
3 ' AT GGCGCC CGCGGTTGAGGTTTCCGGCA GATCTG G5 ' 
SacII Xbal 
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PGR assembly method for generation of random 
combinational library of the loop region of the b. 
stearo'therrn onhi l iig LDH aenor 

1. Single-stranded oligos were made such that the oligos were 
only different to the wild-type sequence at positions encoding 
amino acids 101 and 102 where each one of the bases A, t; g 
has an equal chance of being inserted. (Oligo mix 101,102 
forward. ) 

2. An Mlul restriction site which is present in the wild-type 
template is destroyed by change of the third codon position of 
amino acid 108 from an ACG to an ACT without altering threonine 
as the amino acid being coded. The absence of the Mlul site 
enables verification that the mutants have been generated and to 
select against wild-type sequences. 

3. A DNA primer which has 14 base homology to olio mix 101, 102 
forward was used to make the complementary strand (oligo mix 
101,102 reverse) using a Klenow reaction. 

4. Single-stranded library oligos were used with primer 1 and 
5ng of wild-type template in order to generate a 3 00 base pair 
product with 25 cycles of PGR (94°C, for 1 minute, 55 *C for 1 
minute, 72 «C for 2 minutes). 

5. Double-stranded Klenow oligos were used with primer 2 and 
5ng of wild-type template to generate an 800 base pair product 
which overlaps the 300 base pair product. (PGR conditions as in 
4.) 

The use of double-stranded oligo as primer in 5 is very important 
in ensuring that both the 300 and 800 base pair products are made 
and primed using mutant oligos and that the wild-type sequence 
at position 101 and 102 is not copied. 

6. After gel purification, 20ng of the 300 base pair product 
and 60 ng of the 800 base pair product were mixed without primers 
and thermocycled seven times in order to join the fragments (94 °c 
for 2 minutes, 55 ^C for 1 minute, 72 ^'G for 4 minutes) . 

7. After seven cycles, primers 1 and 2 were added, and the 
product amplified for twenty cycles (94 °C for 1.5 minutes, 55 °C 
for 1 minute, 72«^c for 2.5 minutes). 

8. The 1 kb PGR product was then gel purified, digested with 
EcoRI, and gel purified again before ligation into EcoRl-cut 
PUC18 plasmid vector and transformation into coli. 



10 
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9. Recomhiinaut- colonies were selected for by IPTG and X-Gal 
insertional inactivation. 

10. Of the nine white colonies picked, seven were verified for 
the presence of the LDH gene and to resistance to Mlul digestion 
via gel and restriction analysis. The other two did not have 
inserts , 

11. Six of the mutants were sequenced using a Dupont 2 000 
sequencer and confirm that the random mutagenesis approach had 
been achieved. 

See Table 3 below: 
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Generation o f double-stranded DNA loop fraainents hy 
oliaonucleotide-overlap 

Each pair of overlapping oligonucleotides (20)LtM of 
each) were subjected to 30 cycles of annealing and extension 
(94«C for 1 minute, cool to 45«C for 2 minutes, 45^C for 1 
minute, heat to 72 °C in 1 minute, 72 °C for 1 minute in 50^1 
containing 0,05 M KCl, lOmM Tris pH 8 . 3 , 1.5 mM MgCl2, 0,01% 
gelatin), 200mM of each dNTP and 2,5 units TAQ DNA polymerase). 
The double-stranded DNA product was purified and then cut with 
-SacII and Xbal before ligating it into the plasmid pLDHrs cut 
with the same enzymes. The ligated products were restricted with 
Mlul to cleave wild-type plasmid pLDHrs. 

The DNA was purified, m'icrodialysed and used to 
transform coli TG2 cells by electroporation . Transformed 
cells were selected for ampicillin resistance. Ten such colonies 
were picked and plasmid DNA purified from overnight cultures. 
The presence of mutant loops was confirmed by resistance to Mlul 
digestion. 

The expression of the enzymes was obtained as described 

above. 

Purification o f lactate dehydrogenase and mutants 

Overnight cultures (11) were centrifuged and the packed 
cells were resuspended in 50 mM triethanolamine , pH 6.0. The 
cells were sonicated and the debris was removed by 
centrifugation. The protein in the supernatant was precipitated 
by the addition of 65% ammonium sulphate. The precipitate was 
spun down and resuspended in 50 mM triethanolamine, pH 6.0 and 
dialysed against the same buffer. After dialysis, NADH and FBP 
were added to the protein to final concentrations of 5mM and lOmM 
before loading onto an oxamate Sepharose column which had been 
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pre-equiiibrated with 50 niM triethanoloamine, pH 6.0, 3 mM KADH 
and 5 mM FBP. After washing off unbound protein with column 
buffer mutant LDH was eluted with 50 mM triethanolamine, pH 9.0, 
0.3 M NaCl. The elutant was precipitated with 65% ammonium 
sulphate and then resuspended in and dialysed against 50 mM 
triethanolamine, pH 7.5. The protein was then loaded onto a Q- 
Sepharose Fast Flow column and eluted with a salt gradient. LDH 
eluted at a concentration of 0.25 M NaCl. For the double glycine 
mutant enzyme, the first chromatography procedure with oxamate 
Sepharose was replaced by chromatography on Blue Sepharose -F3GA^ 
otherwise the procedure was essentially the same. All proteins 
were judged to be greater than 98% pure from the intensity of 
Coomassie blue staining on an SDS Phast gel (Pharmacia) . The 
yield of protein was usually 0.2g/l of original broth. 

Steady-state Kinetics 

Steady-state measurements were made by following the 
reduction in absorbance at 340nm in the NADH/NAD'^ conversion. 
All assays were at 25°C in the buffer Bis-Tris, pH 6, (20mM:) , 
containing KCl (50mM) and when used fructose-l, 6- bisphosphate 
at 5mM. Protein concentration was determined from the absorbance 
at 280nm using the value of 0.91 for img/ml protein in 1cm path 
and an Mr of 33,000. 

The results from these determinations are shown in 
Table 4 below. 



wo 93/15208 



29 



PCT/GB93/00204 



c 

Q) 
C 

o 

a 
o 
o 

0) 

s 
o 
w 

o 
c;) 

Q) 
CD 

m 
u 
m 



4J 
4-) 

cn 
>1 

fD 
Q) 



< 



a. 

I 



f-3 ID 

+ 



I 



o 
o 



ft, 
I 



a. 

CQ 
I 



04 
DO 

iS. ft. 

in CQ 
CM + 



I 

Q 
h3 



+ 



w 





in 








\D 


CO 






r-l 


o 




o 


r-t 


O 




o 


O 




O 




o 


CO 


• 




r-l 


* 


n 


no 


« 


o 






o 






o 






o 




in 






















VD 


















o 


















o 




o 


T— t 


in 






in 


\D 


CM 




o 










• 


(N 


rn 


• 








o 






o 




O 
























o 










iH 






o 


H 






o 


o 




o 


o 




o 




rs) 


1 






1 


• 




1 


• 














o 






o 






n 


















CO 


















IN 


o 


in 












in 


n 






GO 


\o 


o 


o 


o 


o 




o 








in 






r> 


• 


o 






o 










o 


in 












\£> 








o 


o 










O 










in 


1 






1 




n 


1 




o 






o 






o 




o 












o 


















o 


o 




CO 


r- 


o 


O 








r>j 


n 


O 


H 


H 





o 

VO 



o 
o 



in 
n 



Ex: 



O 

in w 
rs3 CM in 



o 
in 

CM 



KD W 
O fM 



to E 



< 
> 

a. 



2: 



00 
CO 



o 
n 



fx] 

in 



o 
fsj in 



n 
W 

^ in 



nj 
o 



00 

CM 



(TJ E 

« 's 



2; 

u 



o 
in 



I I 



• (M 
. in rH 
1-1 H tH 



I I t 

n 

« o 

o ID in 



o 



< 
O 

< 



o 

H 

o 

H 



W 

< 

O 

a; 
< 



in 

CM 



O 

in 



in 



n 



CO 

o 



n 
W 
in 



I i 



O fH 



U S 
ID B 

X 

a: 
I 
I 

o 
><; 
o 
I 

CM 



Cx3 
O 

H 
03 



n 

o o ly 
<N (M ,H 



o 
o 



o o 

rH CO 



O 

rM 



n 
W 

CO 



03 



O 



O 

o 



in 



cn 

CO 



00 

in VD 



n 
cn 





















W 




CO 




in 


(M 




n 


m 








in 


in 



















• 


CO 


CO 


CM 






n 




H 



to 
u 

J 
X 

w 
a: 

I 

o 

X 

o 
I 

CM 



E 



EH 
O 
< 

o 
oi 
0. 



m 

0 
4^ 



U 

c 
m 

JQ 
U 
O 

to 

XI 

ro 

ro 

-u 
w 
JQ 

CQ 

Q) 



QJ 

O 
4-> 

0) 

a 

O 
4J 
fO 
Jh 

o 
u 
m 

U] 
U) 
0) 



o 



o 
in 

cu 
> 
o 

rd 

(0 

cu 
0 
fi 
to 
> 



wo 93/15208 



PCr/GB93/00204 



30 

Reduction of 4-inethyl-2-oxo-3-pentenoic acid using MVS/GG: 

MVS/GG (6 units moles /minute/ 3 0 C) ) and yeast 

formate dehydrogenase (5 units) were added to a solution of 4- 
inethyl-2-oxo-3-pentenoic acid (1.0 mM) in deoxygenated Tris 
buffer (5inM:pH 6.0; SOml) containing NADH (0.02 mM) , sodium 
formate (3.1 mM) , fructose-l, 6-bisphosphate (0.4 mM) and 
dithiothreitol (0.08 mM) . The solution was stirred at room 
temperature (-20 °C) under nitrogen for 5 days with periodic 
addition of 0 . 2 mM HCl to maintain pH in the range of 6.0 - 6.2. 
Acidification to pH 2.0 and extractive work-up with ethyl acetate 
gave (S) -2-hydroxy-4-methyl-3-pentenoic acid in 91% isolated 
yield. Analysis of the (R)-MTPA derivative and comparison to a 
racemic standard gave a value of at least 99% for entantiomeric 
excess . 
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Claims: - 

1" A method for modifying the specificity and/or 

efficiency of an enzyme, while retaining its catalytic activity, 
characterised in that it comprises: selecting an enzyme, the 
tertiary structure of which is substantially known or deduced; 
identifying at least one specificity and/or efficiency-related 
region; identifying or constructing unique restriction sites 
bounding the identified region in the DNA coding therefore- 
generating a DNA sequence which corresponds to at least a portion 
of the identified region, except that the nucleotides of at least 
one codon are randomized, or selecting as a substitute for at 
least a portion of the identified region an alternative such 
region, which may itself be similarly randomized; using the 
generated or substitute DNA sequence to replace the original such 
sequence; expressing the DNA including the generated or 
substitute DNA sequence; and selecting for a desired modification 
so that the DNA coding therefor may be isolated. 

^- A method as claimed in claim 1 wherein the enzyme 

selected is a dehydrogenase. 

^' A method as claimed in claim 2 wherein the 

dehydrogenase is an a-hydroxy acid dehydrogenase. 

^' A method as claimed in any of claims 1 to 3 wherein a 

loop region of an enzyme is identified. 

^' A method as claimed in any of claims 1 to 4 wherein the 

randomized DNA is generated by means of an inosine triphosphate 
PGR method or a spiked oligonucleotide method or a PGR assembly 
method. 

^* A method as claimed in any of claims r to 5 wherein the 

selected substitute is based on a corresponding sequence from a 
similar enzyme. 
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7. A method as claimed in any of claims 1 to 6 wherein the 
generated or substitute DNA is cloned into a plasmid or phage 
vector and transformed into a bacteria or virus for expression. 

8. A method as claimed in any of claims 1 to 7 wherein the 
enzyme is L-lactate dehydrogenase, positions 101 and 102 having 
been randomized. 

9. A process for the production of a chiral product 
characterised in that it comprises the use of an enzyme which has 
been modified by a method as claimed in any of claims 1 to 8 . 

10. A process as claimed in claim 9 wherein a cof actor 
recycling system is provided. 

11. A process for the reduction of 2-oxo-4-phenyl-propanoic 
acid characterised in that it comprises the use of L-lactate 
dehydrogenase, which has been modified in the loop region by a 
method as claimed in any of claims l to 8. 

12. A process for the reduction of 4-methyl-2-oxo-3- * 
pentenoic acid characterised in that it comprises the use of 
MVS/GG obtainable by a method as claimed in any of claims - l to 
8. 
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